On Software Defect Prediction Using Machine Learning

نویسندگان

  • Jinsheng Ren
  • Ke Qin
  • Ying Ma
  • Guangchun Luo
چکیده

The goal of this paper is to catalog the software defect prediction using machine learning. Over the last few years, the eld of software defect prediction has been extensively studied because of it's crucial position in the area of software reliability maintenance, software cost estimation and software quality assurance. An insurmountable problem associated with software defect prediction is the class imbalance problem. Recently, the Asymmetric Partial Least Squares Classi er (APLSC) reported by Qu et al [1] and the Kernel Principle Component Analysis (KPCA) reported in [2] have been shown to be able to tackle the class imbalance problem. Although both the APLSC and KPCA are quite novel, they have their own disadvantages: First, the APLSC is a bilinear classi er, in which the dimension is mapped to a bilinear subspace. Second, the APLSC su ers from high overlapping, especially when the data sets are nonlinear separate. The KPCA regression model does not consider the correlation between principal components and the class attribution. In this paper, we propose two kernel based classi ers, called the Asymmetric Kernel Partial Least Squares Classi er (AKPLSC) and Asymmetric Kernel Principal Component Analysis Classi er (AKPCAC) to solve the class imbalance problem. The kernel function we use is the Gaussian kernel function. Experiments are conducted on the NASA and SOFTLAB data sets. Theoretical analysis and experimental results (using F-measure, Friedman test and Tukey test) con rm the correctness and e ectiveness of our methods. As far as we know, the results presented in this paper are novel.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Software Defects through SVM: An Empirical Approach

Software defect prediction is an important aspect of preventive maintenance of a software. Many techniques have been employed to improve software quality through defect prediction. This paper introduces an approach of defect prediction through a machine learning algorithm, support vector machines (SVM), by using the code smells as the factor. Smell prediction model based on support vector machi...

متن کامل

Benchmarking Machine Learning Technologies for Software Defect Detection

Machine Learning approaches are good in solving problems that have less information. In most cases, the software domain problems characterize as a process of learning that depend on the various circumstances and changes accordingly. A predictive model is constructed by using machine learning approaches and classified them into defective and non-defective modules. Machine learning techniques hel...

متن کامل

Comparing the Effectiveness of Machine Learning Algorithms for Defect Prediction

Software repositories with defect logs are main resource for defect prediction. In recent years, researchers have used the vast amount of data that is contained by software repositories to predict the location of defect in the code that caused problem. In this paper machine learning approach is used for predicting the modules with defect for embedded data set. Public datasets from the promise r...

متن کامل

Software defect prediction using relational association rule mining

This paper focuses on the problem of defect prediction, a problem of major importance during software maintenance and evolution. It is essential for software developers to identify defective software modules in order to continuously improve the quality of a software system. As the conditions for a software module to have defects are hard to identify, machine learning based classification models...

متن کامل

Sampling Imbalance Dataset for Software Defect Prediction Using Hybrid Neuro-fuzzy Systems with Naive Bayes Classifier

Original scientific paper Software defect prediction (SDP) is a process with difficult tasks in the case of software projects. The SDP process is useful for the identification and location of defects from the modules. This task will tend to become more costly with the addition of complex testing and evaluation mechanisms, when the software project modules size increases. Further measurement of ...

متن کامل

An Hybrid Multilayer Perceptron Using Gso-ga for Software Defect Prediction

Software defect prediction has turned into an expected requirement for organizations to guarantee quality and reliability of software products. The early defect prediction can encourage managers to amend and improve reliability of product. Methodologies, for example, machine learning and neural network have ended up as eminent solution for training and classification of data and can be importan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Applied Mathematics

دوره 2014  شماره 

صفحات  -

تاریخ انتشار 2014